Default Prediction for Real Estate Companies with Imbalanced Dataset

نویسندگان

  • Yuan-Xiang Dong
  • Zhi Xiao
  • Xue Xiao
چکیده

When analyzing default predictions in real estate companies, the number of non-defaulted cases always greatly exceeds the defaulted ones, which creates the twoclass imbalance problem. This lowers the ability of prediction models to distinguish the default sample. In order to avoid this sample selection bias and to improve the prediction model, this paper applies a minority sample generation approach to create new minority samples. The logistic regression, support vector machine (SVM) classification, and neural network (NN) classification use an imbalanced dataset. They were used as benchmarks with a single prediction model that used a balanced dataset corrected by the minority samples generation approach. Instead of using predictionoriented tests and the overall accuracy, the true positive rate (TPR), the true negative rate (TNR), G-mean, and F-score are used to measure the performance of default prediction models for imbalanced dataset. In this paper, we describe an empirical experiment that used a sampling of 14 default and 315 non-default listed real estate companies in China and report that most results using single prediction models with a balanced dataset generated better results than an imbalanced dataset. Keywords—Default prediction, Imbalanced dataset, Real estate listed companies, Minoritysample generation approach

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of Default Risk Based on KMV Model—An Empirical Study for Chinese Real Estate Companies

In this paper, we analyze the default risk of Chinese real estate companies with KMV model and time-varying copula. We collected the data of the listed real estate companies in Shanghai and Shenzhen Exchanges from 2007 to 2012 to calculate the default distance and correlations. Experiments results show that the default risk increases during the financial crisis. Moreover, results also indicate ...

متن کامل

Discussant remarks on Andrew Felton and Joseph B Nichols' paper "Commercial real estate loan performance at failed US banks"

This is a very nice paper, well motivated and based on a unique dataset. Using loan-level data from community banks entering Federal Deposit Insurance Corporation (FDIC) receivership, the paper estimates the probability of default (PD) and loss-given default (LGD) of two distinct types of commercial real estate (CRE) loans. The authors also compare commercial real estate loans in community bank...

متن کامل

Investigating the missing data effect on credit scoring rule based models: The case of an Iranian bank

Credit risk management is a process in which banks estimate probability of default (PD) for each loan applicant. Data sets of previous loan applicants are built by gathering their data, and these internal data sets are usually completed using external credit bureau’s data and finally used for estimating PD in banks. There is also a continuous interest for bank to use rule based classifiers to b...

متن کامل

Finding Default Barrier and Optimal Cutoff Rate in KMV Structural Model based on the best Ranking of Companies

According to the adverse consequences that are brought by financial distress for companies, economy and financial –monetary institutions, the use of methods that can predict the occurrence of financial failure and prevent the loss of wealth is of great importance. The major models of credit risk assessment are based on retrospective information and using the methods which use the updated market...

متن کامل

Comparing Prediction Power of Artificial Neural Networks Compound Models in Predicting Credit Default Swap Prices through Black–Scholes–Merton Model

Default risk is one of the most important types of risks, and credit default swap (CDS) is one of the most effective financial instruments to cover such risks. The lack of these instruments may reduce investment attraction, particularly for international investors, and impose potential losses on the economy of the countries lacking such financial instruments, among them, Iran. After the 2007 fi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JIPS

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2014